A Database for Handwriting Recognition Research in Sinhala Language

نویسندگان

  • H. C. Fernando
  • N. D. Kodikara
  • Sanjika Hewavitharana
چکیده

This article presents a database of images of handwritten city names. The aim is to provide a standard database for Sinhala handwriting recognition research. This database contains about 15,000 images of about 500 city names of Sri Lanka. These images are obtained from the addresses of live mail so that the writers had no idea that they would be used for this purpose. Also, these are unconstrained handwriting images unlike the images collected using prescribed forms in laboratory environment. The images are divided into two groups, training set and testing set. This enables the comparison of results of different researches and serves the purpose of being a standard database.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Thresholding, Noise Reduction and Skew correction of Sinhala Handwritten Words

The Sinhala script, which is generally with round characters, is unique among other Brahmi-descended scripts and is used by 70% of the 18 million populations in Sri Lanka. There has been no published research on the cursive unconstrained Sinhala handwriting recognition. This paper proposes vital preprocessing stages, which are categorized under thresholding, noise removal, skew detection and co...

متن کامل

Off-Line Sinhala Handwriting Recognition Using Hidden Markov Models

This paper describes a method to recognize off-line handwritten Sinhala characters, the language used by the majority of Sri Lanka. The classification approach is based on discrete hidden Markov models. A subset of the Sinhala alphabet was chosen for the study. The unknown characters are first pre-classified into one of three character groups, based on the structural properties of the text line...

متن کامل

Isolated Persian/Arabic handwriting characters: Derivative projection profile features, implemented on GPUs

For many years, researchers have studied high accuracy methods for recognizing the handwriting and achieved many significant improvements. However, an issue that has rarely been studied is the speed of these methods. Considering the computer hardware limitations, it is necessary for these methods to run in high speed. One of the methods to increase the processing speed is to use the computer pa...

متن کامل

Off-line Arabic Handwritten Recognition Using a Novel Hybrid HMM-DNN Model

In order to facilitate the entry of data into the computer and its digitalization, automatic recognition of printed texts and manuscripts is one of the considerable aid to many applications. Research on automatic document recognition started decades ago with the recognition of isolated digits and letters, and today, due to advancements in machine learning methods, efforts are being made to iden...

متن کامل

AltecOnDB: A Large-Vocabulary Arabic Online Handwriting Recognition Database

Arabic is a semitic language characterized by a complex and rich morphology. The exceptional degree of ambiguity in the writing system, the rich morphology, and the highly complex word formation process of roots and patterns all contribute to making computational approaches to Arabic very challenging. As a result, a practical handwriting recognition system should support large vocabulary to pro...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003